Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction

Identifieur interne : 000674 ( Main/Exploration ); précédent : 000673; suivant : 000675

State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction

Auteurs : Il Koo [Corée du Sud] ; Ik Cho [Corée du Sud]

Source :

RBID : ISTEX:CD4F13A6EEDBE2D22CEBF518F7B19A9D69FE1D5A

Abstract

Abstract: This paper proposes a new approach to the estimation of document states such as interline spacing and text line orientation, which facilitates a number of tasks in document image processing. The proposed method can be applied to spatially varying states as well as invariant ones, so that general cases including images of complex layout, camera-captured images, and handwritten ones can also be handled. Specifically, we find CCs (Connected Components) in a document image and assign a state to each of them. Then the states of CCs are estimated using an energy minimization framework, where the cost function is designed based on frequency domain analysis and minimized via graph-cuts. Using the estimated states, we also develop a new algorithm that performs text block identification and text line extraction. Roughly speaking, we can segment an image into text blocks by cutting the distant connections among the CCs (compared to the estimated interline spacing), and we can group the CCs into text lines using a bottom-up grouping along the estimated text line orientation. Experimental results on a variety of document images show that our method is efficient and provides promising results in several document image processing tasks.

Url:
DOI: 10.1007/978-3-642-15552-9_31


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction</title>
<author>
<name sortKey="Koo, Il" sort="Koo, Il" uniqKey="Koo I" first="Il" last="Koo">Il Koo</name>
</author>
<author>
<name sortKey="Cho, Ik" sort="Cho, Ik" uniqKey="Cho I" first="Ik" last="Cho">Ik Cho</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:CD4F13A6EEDBE2D22CEBF518F7B19A9D69FE1D5A</idno>
<date when="2010" year="2010">2010</date>
<idno type="doi">10.1007/978-3-642-15552-9_31</idno>
<idno type="url">https://api.istex.fr/document/CD4F13A6EEDBE2D22CEBF518F7B19A9D69FE1D5A/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001220</idno>
<idno type="wicri:Area/Istex/Curation">001146</idno>
<idno type="wicri:Area/Istex/Checkpoint">000254</idno>
<idno type="wicri:doubleKey">0302-9743:2010:Koo I:state:estimation:in</idno>
<idno type="wicri:Area/Main/Merge">000679</idno>
<idno type="wicri:Area/Main/Curation">000674</idno>
<idno type="wicri:Area/Main/Exploration">000674</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction</title>
<author>
<name sortKey="Koo, Il" sort="Koo, Il" uniqKey="Koo I" first="Il" last="Koo">Il Koo</name>
<affiliation wicri:level="4">
<country>Corée du Sud</country>
<placeName>
<settlement type="city">Séoul</settlement>
</placeName>
<orgName type="university">Université nationale de Séoul</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Corée du Sud</country>
</affiliation>
</author>
<author>
<name sortKey="Cho, Ik" sort="Cho, Ik" uniqKey="Cho I" first="Ik" last="Cho">Ik Cho</name>
<affiliation wicri:level="4">
<country>Corée du Sud</country>
<placeName>
<settlement type="city">Séoul</settlement>
</placeName>
<orgName type="university">Université nationale de Séoul</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Corée du Sud</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2010</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">CD4F13A6EEDBE2D22CEBF518F7B19A9D69FE1D5A</idno>
<idno type="DOI">10.1007/978-3-642-15552-9_31</idno>
<idno type="ChapterID">31</idno>
<idno type="ChapterID">Chap31</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: This paper proposes a new approach to the estimation of document states such as interline spacing and text line orientation, which facilitates a number of tasks in document image processing. The proposed method can be applied to spatially varying states as well as invariant ones, so that general cases including images of complex layout, camera-captured images, and handwritten ones can also be handled. Specifically, we find CCs (Connected Components) in a document image and assign a state to each of them. Then the states of CCs are estimated using an energy minimization framework, where the cost function is designed based on frequency domain analysis and minimized via graph-cuts. Using the estimated states, we also develop a new algorithm that performs text block identification and text line extraction. Roughly speaking, we can segment an image into text blocks by cutting the distant connections among the CCs (compared to the estimated interline spacing), and we can group the CCs into text lines using a bottom-up grouping along the estimated text line orientation. Experimental results on a variety of document images show that our method is efficient and provides promising results in several document image processing tasks.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Corée du Sud</li>
</country>
<settlement>
<li>Séoul</li>
</settlement>
<orgName>
<li>Université nationale de Séoul</li>
</orgName>
</list>
<tree>
<country name="Corée du Sud">
<noRegion>
<name sortKey="Koo, Il" sort="Koo, Il" uniqKey="Koo I" first="Il" last="Koo">Il Koo</name>
</noRegion>
<name sortKey="Cho, Ik" sort="Cho, Ik" uniqKey="Cho I" first="Ik" last="Cho">Ik Cho</name>
<name sortKey="Cho, Ik" sort="Cho, Ik" uniqKey="Cho I" first="Ik" last="Cho">Ik Cho</name>
<name sortKey="Koo, Il" sort="Koo, Il" uniqKey="Koo I" first="Il" last="Koo">Il Koo</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000674 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000674 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:CD4F13A6EEDBE2D22CEBF518F7B19A9D69FE1D5A
   |texte=   State Estimation in a Document Image and Its Application in Text Block Identification and Text Line Extraction
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024